APPENDIX 



1 . A voice information registration method, employed by a speech recognition apparatus, 
comprising: 

(a) obtaining a sentence group, which includes a first to an N-th sentence, wherein N is a 
number equal to or greater than two; 

(b) obtaining a sounds-like spelling for a word that is included in an i-th sentence, but is not 
entered in a speech recognition dictionary, wherein i is a number equal to or less than N; 

(c) obtaining a base form based on said sounds-like spelling of said word; and 

(d) registering said base form in a speech recognition dictionary in correlation with said word. 

2. A sentence specification method, employed by a speech recognition apparatus, 
comprising: 

a registration step including: 

(al) obtaining a sentence group, which includes a first to an N-th sentence, wherein N is a 
number equal to or greater than two, 

(a2) obtaining a sounds-like spelling for a word that is included in an i-th sentence, but is not 
entered in a speech recognition dictionary, wherein i is a number equal to or less than N, 

(a3) obtaining a base form based on said sounds-like spelling of said word, and 

(a4) registering said base form in a speech recognition dictionary in correlation with said 
word, and 

a recognition step including: 

(b 1 ) obtaining voice information that is input as a user reads and vocally reproduces a display 
corresponding to said i-th sentence, 

(b2) employing said base form to recognize said voice information and to select a speech 
recognition sentence, and 

(b3) comparing said i-th sentence with said selected speech recognition sentence. 
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3. The sentence specification method according to claim 2, wherein said group of sentences 
is obtained from an application, said method further comprising a step of generating a control 
message corresponding to said i-th sentence and transmitting said control message to said 
application. 

4. The sentence specification method according to claim 2, wherein a sounds-like spelling 
score is stored in correlation with the sounds-like spelling of said word, wherein a pronunciation 
score is stored in correlation with said base form, and wherein, when a function value that is obtained 
by using said sounds-like spelling score and said pronunciation score exceeds a threshold value, said 
base form is registered in a speech recognition dictionary. 

6. A speech recognition apparatus, comprising: 

(a) a sentence specification unit for obtaining a sentence group, which includes a first to an 
N-th sentence, wherein N is a number equal to or greater than two; 

(b) an unknown word detector for obtaining a sounds-like spelling for a word that is included 
in an i-th sentence, but is not entered in a speech recognition dictionary, wherein i is a number equal 
to or less than N; 

(c) a base form generator for obtaining a base form based on said sounds-like spelling of said 
word; and 

(d) a speech recognition dictionary to which said base form is stored in correlation with said 

word. 

7. A speech recognition apparatus, comprising: 

(a) a sentence specification unit for obtaining a sentence group, which includes a first to an 
N-th sentence, wherein N is a number equal to or greater than two; 

(b) an unknown word detector for obtaining a sounds-like spelling for a word that is included 
in an i-th sentence, but is not entered in a speech recognition dictionary, wherein i is a number equal 
to or less than N; 



12 



(c) a base form generator for obtaining a base form based on said sounds-like spelling of said 

word; 

(d) a speech recognition dictionary in which said base form is stored in correlation with said 

word; 

(e) a voice input unit for obtaining voice information that is input as a user reads and vocally 
reproduces a display corresponding to said i-th sentence; and 

(f) a speech recognition engine for employing said base form to recognize said voice 
information and to select a speech recognition sentence; 

wherein said sentence specification unit compares said i-th sentence with said selected speech 
recognition sentence. 

8. The speech recognition apparatus according to claim 7, wherein said sentence 
specification unit obtains said group of sentences from an application, generates a control message 
corresponding to said i-th sentence, and transmits said control message to said application. 

9. The speech recognition apparatus according to claim 7, wherein a sounds-like spelling 
score is stored in correlation with the sounds-like spelling of said word, wherein a pronunciation 
score is stored in correlation with said base form, and wherein, when a function value that is obtained 
by using said sounds-like spelling score and said pronunciation score exceeds a threshold value, said 
base form is registered in a speech recognition dictionary. 

1 1 . A storage medium in which a program for specifying a sentence is stored to be executed 
by a speech recognition apparatus, said program comprising: 

(a) program code for instructing said speech recognition apparatus to obtain a sentence group, 
which includes a first to an N-th sentence, wherein N is a number equal to or greater than two; 

(b) program code for instructing said speech recognition apparatus to obtain a sounds-like 
spelling for a word that is included in an i-th sentence, but is not entered in a speech recognition 
dictionary, wherein i is a number equal to or less than N; 
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(c) program code for instructing said speech recognition apparatus to obtain a base form 
based on said sounds-like spelling of said word; and 

(d) program code for instructing said speech recognition apparatus to register said base form 
in a speech recognition dictionary in correlation with said word. 

12. A storage medium in which a program for specifying a sentence is stored to be executed 
by a speech recognition apparatus, said program comprising: 

(a) program code for instructing said speech recognition apparatus to obtain a sentence group, 
which includes a first to an N-th sentence, wherein N is a number equal to or greater than two; 

(b) program code for instructing said speech recognition apparatus to obtain a sounds-like 
spelling for a word that is included in an i-th sentence, but is not entered in a speech recognition 
dictionary, wherein i is a number equal to or less than N; 

(c) program code for instructing said speech recognition apparatus to obtain a base form 
based on said sounds-like spelling of said word; 

(d) program code for instructing said speech recognition apparatus to register said base form 
in a speech recognition dictionary in correlation with said word; 

(e) program code for instructing said speech recognition apparatus to obtain voice 
information that is input as a user reads and vocally reproduces a display corresponding to said i-th 
sentence; 

(f) program code for instructing said speech recognition apparatus to employ said base form 
to recognize said voice information and to select a speech recognition sentence; and 

(g) program code for instructing said speech recognition apparatus to compare said i-th 
sentence with said selected speech recognition sentence. 

13. The storage medium according to claim 12, wherein said group of sentences is obtained 
from an application, and wherein program code is stored to instruct said speech recognition 
apparatus to generate a control message corresponding to said i-th sentence and to transmit said 
control message to said application. 
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14. The storage medium according to claim 12, wherein a sounds-like spelling score is stored 
in correlation with the sounds-like spelling of said word, wherein a pronunciation score is stored in 
correction with said base form, and wherein, when a function value that is obtained by using said 
sounds-like spelling score and said pronunciation score exceeds a threshold value, said based form 
is registered in a speech recognition dictionary. 
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